The final CUDA source code.  Three files, one is the kernel file which also contains the final versions of the kernels.  There is another kernel file that contains the supporting device kernels for the chosen  gpu\_send/recv.  The other is the main CUDA file with proper kernel calls and grid layout.  
